Skywork O1 Open PRM Qwen 2.5 1.5B
Other
Skywork o1 Open-PRM-Qwen-2.5-1.5B is an incremental process reward model trained on Qwen2.5-Math-1.5B-Instruct, specifically designed to enhance small-scale complex problem-solving capabilities.
Large Language Model
PyTorch